Global Meta-Analysis of Transcriptomics Studies

نویسندگان

  • José Caldas
  • Susana Vinga
چکیده

Transcriptomics meta-analysis aims at re-using existing data to derive novel biological hypotheses, and is motivated by the public availability of a large number of independent studies. Current methods are based on breaking down studies into multiple comparisons between phenotypes (e.g. disease vs. healthy), based on the studies' experimental designs, followed by computing the overlap between the resulting differential expression signatures. While useful, in this methodology each study yields multiple independent phenotype comparisons, and connections are established not between studies, but rather between subsets of the studies corresponding to phenotype comparisons. We propose a rank-based statistical meta-analysis framework that establishes global connections between transcriptomics studies without breaking down studies into sets of phenotype comparisons. By using a rank product method, our framework extracts global features from each study, corresponding to genes that are consistently among the most expressed or differentially expressed genes in that study. Those features are then statistically modelled via a term-frequency inverse-document frequency (TF-IDF) model, which is then used for connecting studies. Our framework is fast and parameter-free; when applied to large collections of Homo sapiens and Streptococcus pneumoniae transcriptomics studies, it performs better than similarity-based approaches in retrieving related studies, using a Medical Subject Headings gold standard. Finally, we highlight via case studies how the framework can be used to derive novel biological hypotheses regarding related studies and the genes that drive those connections. Our proposed statistical framework shows that it is possible to perform a meta-analysis of transcriptomics studies with arbitrary experimental designs by deriving global expression features rather than decomposing studies into multiple phenotype comparisons.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The global trend of infertility: an original review and meta-analysis

Background and aims: Infertility is one of the most important conditions in reproductive system and there is no reliable estimates for global prevalence of infertility. Therefore, knowing the prevalence of infertility is important and can be effective in decision making. Methods: We systematically reviewed all published papers in Medline database and Scopus (1988–2010). Univariate and multivar...

متن کامل

Meta-analysis of Incidence of Brain Cancer Among Aircrew

Introduction: Previous studies on Brain and other Nervous System Cancers (BNSC) and aircrew have shown inconsistent results, possibly due to their relatively small sample sizes; therefore, the current study aimed to increase the precision of risk estimates.Methods: Systematic searches of PubMed and Embase for pertinent studies up to August 2016 were perfo...

متن کامل

Author's response to reviews Title:Gestational tissue transcriptomics in term and preterm human pregnancies: A systematic review and meta-analysis Authors:

Title:Gestational tissue transcriptomics in term and preterm human pregnancies: A systematic review and meta-analysis

متن کامل

Predicting Iran's economic growth rate using meta-analysis method

One of the most important issues for governments to maintain and improve their position in the regional and global economy is the state of economic growth; one of the important issues in this situation is to predict the rate of economic growth. Proper forecasting of economic growth has very important effects on government policy and economic planning, and can help policymakers decide on future ...

متن کامل

Improving Reproducibility and Candidate Selection in Transcriptomics Using Meta-analysis

Transcriptomic experiments are often used in neuroscience to identify candidate genes of interest for further study. However, the lists of genes identified from comparable transcriptomic studies often show limited overlap. One approach to addressing this issue of reproducibility is to combine data from multiple studies in the form of a meta-analysis. Here, we discuss recent work in the field of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2014